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Abstract. All physical systems in equilibrium obey the laws of thermody- 
namics. In other words, whatever the precise nature of the interaction be- 
tween the atoms and molecules at the microscopic level, at the macroscopic 
level, physical systems exhibit universal behavior in the sense that they are 
all governed by the same laws and formulae of thermodynamics. In this paper 
we describe some recent history of universality ideas in physics starting with 
Wigner's model for the scattering of neutrons off large nuclei and show how 
these ideas have led mathematicians to investigate universal behavior for a va- 
riety of mathematical systems. This is true not only for systems which have a 
physical origin, but also for systems which arise in a purely mathematical con- 
text such as the Riemann hypothesis, and a version of the card game solitaire 
called patience sorting. 



1. Introduction 

All physical systems in equilibrium obey the laws of thermodynamics. The first 
law asserts the conservation of energy. The second law has a variety of formulations, 
one of which is the following: Suppose that in a work cycle a heat engine extracts 
Qi units of heat from a heat reservoir at temperature Ti, performs W units of 
work, and then exhausts the remaining Q2 = Qi — W units of heat to a heat 
sink at temperature T2 < Ti. Let V — ^ denote the efficiency of the conversion 
of heat into work. Then the second law tells us there is a maximal efficiency 
?7max = {Ti — T2)/Ti, depending only on Ti and T2, so that for all heat engines, 
and all work cycles, 

V<VmaK- (1) 

Nature is so set up that we just cannot do any better. 

On the other hand, it is a very old thought, going back at least to Democritus 
and the Greeks, that matter, all matter, is built out of tiny constituents - atoms 
- obeying their own laws of interaction. The juxtaposition of these two points 
of view, the macroscopic world of tangible objects and the microscopic world of 
atoms, presents a fundamental, difficult and long-standing challenge to scientists; 
namely, how does one derive the macroscopic laws of thermodynamics from the 
microscopic laws of atoms? The special, salient feature of this challenge is that 
the same laws of thermodynamics should emerge no matter what the details of 
the atomic interaction. In other words, on the macroscopic scale, physical systems 
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should exhibit universaUty^. Indeed, it is the very emergence of universal behavior 
for macroscopic systems that makes possible the existence of physical laws. 

This kind of thinking, however, is not common in the world of mathematics. 
Mathematicians tend to think of their problems as sui generis, each with its own 
special, distinguishing features. Two problems are regarded as "the same" only 
if some isomorphism, explicit or otherwise, can be constructed between them. In 
recent years, however, universality in the above sense of macroscopic physics has 
started to emerge in a wide variety of mathematical problems, and the goal of 
this paper is to illustrate some of these developments. As we will see, there are 
problems from diverse areas, often with no discernible, mechanistic connections, all 
of which behave, on the appropriate scale, in precisely the same way. The list of 
such problems is varied, long and growing, and points to the emergence of what 
one might call "macroscopic mathematics." 

A precedent for the kind of results that we are going to describe is given by the 
celebrated central limit theorem of probability theory, where one considers inde- 
pendent, identically distributed variables {x„}„>i. The central limit theorem tells 
us that if we center and scale the variables, Xn ^ Vn = {xn ^ IExp(x„)) / ^Jya,^:{xn), 
then 



lim Prob ^''tI" <t] ^ e~~ —= . (2) 
V V"- / J-oo V27r 

We see here explicitly that the Gaussian distribution on the right-hand side of Q 
is universal, independent of the distribution for the x„'s. The proof of the central 
limit theorem for independent coin flips, Prob(a;„ = +1) = Prob(a;„ = —1) = ^, 
goes back to de Moivre and Laplace in the 18'^ century. 

The outline of the paper is as follows: In Section |21 we will introduce and discuss 
some models from random matrix theory (RMT). Various distributions associated 
with these models will play the same role in the problems that we discuss later 
on in the paper as the Gaussian does in As noted above, thermodynamics re- 
flects universality for all macroscopic systems, but there are also many universality 
sub-classes which describe the behavior of physical systems in restricted situations. 
For example, many fluids, such as water and vinegar, obey the Navier-Stokes equa- 
tion, but a variety of heavy oils obey the lubrication equations. In the same way 
we will see that certain mathematical problems are described by so-called Unitary 
Ensembles of random matrices, and others by so-called Orthogonal or Symplectic 
Ensembles. In Section O we present a variety of problems from different areas 
of mathematics, and in Section 0] we show how these problems are described by 
random matrix models from Section El In the final Section El we discuss briefly 
some of the mathematical methods that are used to prove the results in Sectional 
Here combinatorial identities, Riemann-Hilbert problems (RHP's) and the nonlin- 
ear steepest descent method of |DeiZho| . as well as the classical steepest descent 
method, play a key role. We end the section with some speculations, suggesting 
how to place the results of Sections and 01 in a broader mathematical framework. 



In physics, the term "universaUty" is usually used in the more limited context of scaling laws 
for critical phenomena. In this paper we use the term "universality" more broadly in the spirit of 
the preceding discussion. We trust this will cause no confusion. 



UNIVERSALITY FOR MATHEMATICAL AND PHYSICAL SYSTEMS 



3 



2. Random matrix models 

There are many ensembles of random matrices that are of interest, and we refer 
the reader to the classic text of Mehta |Meh| for more information (see also |Deilp . 
In this paper we will consider almost exclusively (see, however, H54|l et seq. below) 
only three kinds of ensembles: 

(a) Orthogonal Ensembles (OE's) consisting of TV x real symmetric matrices 
M, M = M = M^. 

(b) Unitary Ensembles (UE's) consisting oi N x N Hermitian matrices M, 
M = M*. 

(c) Symplectic Ensembles (SE's) consisting of 2N x 2N Hermitian, self-dual 
matrices M = M* = JM'^j'^, where J is the standard 2N x 2N block 

diagonal symplectic matrix, J = diag(T, t, . . . , t), t — 

For reasons that will soon become clear, OE's, UE's and SE's are labeled by a 
parameter /?, where /5 = 1, 2 or 4, respectively. In all three cases the ensembles are 
equipped with probability distributions of the form 

where Vn,i3 is a real- valued function on M such that Vat,/? (a;) — > -l-oo sufficiently 
rapidly as \x\ oo, Z^^^p is a normalization coefficient, and dpM denotes Lebesgue 
measure on the algebraically independent entries of M . For example, in the or- 
thogonal case, dp^iM = I\i<j<k<N ^^jki where M = {Mjk) (see, e.g. |Mehj V 
The notation "orthogonal", "unitary", and "symplectic" refers to the fact that 
the above ensembles with associated distributions Q are invariant under conju- 
gation M — > SMS~^, where S is orthogonal, unitary, or unitary-symplectic (i.e., 
S e USp{2N) = {S : SS* = /, SJS'^ = J}) respectively. When VnA^) = 
one has the so-called Gaussian Orthogonal Ensemble (GOE), the Gaussian Unitary 
Ensemble (GUE), and the Gaussian Symplectic Ensemble (GSE), for /? = 1, 2 or 4, 
respectively. 

The distributions give rise in turn to distributions on the eigenvalues Ai < 
A2 < • • • of M 

Piv,/3(A)d^A = ^e-''^5:tiV„,,(A,) -Q |A,-A,f dAi-.-dAAT (4) 

l<t<j<N 

where Zn,i3 is again a normalization coefficient, and 7//3 = 1 if /3 = 1 or 2 and 
774 = 2 (this is because the eigenvalues for (3 = A double up). The labeling of OE's, 
UE's, and SE's by /3 = 1, 2 and 4 is now clear. In all three cases, we see that the 
random matrix ensembles give rise to random particle systems {Ai, A2, . . .} with 
repulsion built in: the probability that two eigenvalues are close together is small 
and vanishes like a power of the distance between them. This is an essential feature 
of random matrix ensembles, in contrast to random Poisson particle systems, say, 
where the particles may bunch together or exhibit large gaps. 

Loosely speaking, we say that a system is modeled by random matrix theory 
(RMT) if it behaves statistically like the eigenvalues of a "large" OE, UE,... matrix. 
In analyzing such systems there is something known as the standard procedure: 
Suppose we wish to compare some statistical quantities {au} in the neighborhood 
of some point A with the eigenvalues {Afc} of some matrix in the neighborhood of 
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some energy E, say, in the bulk of the spectrum. Then we always center and scale 
the afc's and the Afc's, 

flfe ^ Ofc = 7a(afe - Afe ^ Afc = 7A(Afe - (5) 

so that 

Exp(#{ai:'s per unit interval}) = Exp(#{Afc's per unit interval}) = 1. (6) 

For energies E at the edge of the spectrum, the above procedure must be modified 
slightly (see below). 

This procedure can be viewed as follows: A scientist wishes to investigate some 
statistical phenomenon. What s'he has at hand is a microscope and a handbook of 
matrix ensembles. The data {a^} are embedded on a slide which can be inserted 
into the microscope. The only freedom that the scientist has is to center the slide, 
a-k ^ cLk — -A, and then adjust the focus au — A h]. = jaio,k — A) so that on 
average one data point dk appears per unit length on the slide. At that point the 
scientist takes out his'r handbook, and then tries to match the statistics of the 5,^ 's 
with those of the eigenvalues of some ensemble. If the fit is good, the scientist then 
says that the system is well-modeled by RMT. 

It is a remarkable fact, going back to the work of Gaudin and Mehta, and later 
Dyson, in the 1960's, that the key statistics for OE's, UE's, and SE's can be com- 
puted in closed form. This is true not only for finite N, but also for various scaling 
limits as ^ oo. For GOE, GUE, and GSE we refer the reader to jMeh| . Here 
the Hermite polynomials, which are orthogonal with respect to the weight dx 
on K, play a critical role, and the scaling limits as A^ — > oo follow from the known, 
classical asymptotics of the Hermite polynomials. For UE's with general poten- 
tials Vn^i3^2, the techniques described in [Mehj for GUE go through for finite A', 
the role of the Hermite polynomials now being played by the polynomials orthog- 
onal with respect to the weight e~^™''*=^(^) dx on K (see, e.g. |Deilj '). For general 
Vn.i3=2, however, the asymptotic behavior of these polynomials as A^ ^ cxd does 
not follow from classical estimates. In order to overcome this obstacle, the au- 
thors in [DKMVZI and DKM V22) (see also jDeil| for a pedagogical presentation) 
used the Riemann-Hilbert steepest-descent method introduced by Deift and Zhou 
[DeiZhoj . and further developed with Venakides |DVZ| . to compute the asymp- 
totics as A' ^ oo of the orthogonal polynomials for a very general class of analytic 
weights. In view of the preceding comments, the scaling limits of the key sta- 
tistics for UE's then follow for such weights (see also |BleIts| for the special case 
Vn,p=2{x) = N[x^ ~ tx^)). For another approach to UE universality, see [PasSchj . 
For OE's and SE's with classical weights, such as Laguerre, Jacobi, etc., for which 
the asymptotics of the associated orthogonal polynomials are known, the GOE and 
GSE methods in |Meh| apply (see the introductions to [DeiGiol] and |DeiGio2] for 
a historical discussion). For general Vn,i3, (3 — 1 oi A, new techniques are needed, 
and these were introduced, for finite A^, by Tracy and Widom in ITraWidJI and 
|Wid| . In jPeiGiol] and |DeiGio2] . the authors use the results in TraWid2 and 
|Wid| . together with the asymptotic estimates in |DKMVZ2] . to compute the large 
N limits of the key statistics for OE's and SE's with general polynomial weights 

Vn,0{x) = H2ynX^"^ ^ , l^2m > 0. 

It turns out that not only can the statistics for OE's, UE's and SE's be computed 
explicitly, but in the large A^ limit the behavior of these systems is universal in the 
sense described above, as conjectured earlier by Dyson, Mehta, Wigner, and many 
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others. It works like this: Consider N x N matrices M in a UE with potential Vn,2- 
Let Kn denote the finite rank operator with kernel 

Af-l 

KN{x,y) = ipj{x)ipj{y), x,y&R (7) 
i=o 

where 

^,(x)=p,(x)e-5^«'^(-), j>0 (8) 

and 

p,{x) =-f,x' + ■■■ , j>0, 7j>0 (9) 
are the orthonormal polynomials with respect to the weig ht e-^"'2("=) dx, 

p, (a;K(x)e-^".-(") dx = S,k, j, k>0. 

Then the m-point correlation functions 

_ iV! f f - 

-Rm(Al, . . . , Am) = — — /• • •/ Pn, 2(^1, ^n) dXm+1 ■■■ dXN 



{N -my. 

can be expressed in terms of Kn as follows: 

i?„,(Ai,...,A„) = det(XAr(A„A,))^<^^^.<,^. (10) 

A simple computation for the 1-point and 2-point functions, i?i(A) and i?2(Ai, A2), 
shows that 

Exp(#{A, eB})= f i?i(A) dX (11) 

JB 

for any Borel set B cR, and 
Exp(#{orderedpairs(i,j), j : (A,,Aj)gA})= J J i?2(Ai, A2) dAidAj (12) 

for any Borel set A C M^. 

It follows in particular from Hll|l that, for an energy E, Ri{E) — Kpf{E,E) is 
the density of the expected number of eigenvalues in a neighborhood of E, and 
hence, by the standard procedure, one should take the scaling factor 7^ in ||SJ) to 
be Kpf{E,E). For energies E in the bulk of the spectrum, one finds for a broad 
class of potentials Vn^2 (see |DKMVZ1| and |DKMVZ2| ) that, in the scaling limit 
dictated by Kn{E,E), Kn{X,X') takes on a universal form 

where x,y £W and Koo is the so-called sine-kernel, 

KUu) = (14) 
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Inserting this information into H10|l wc see that the scahng hniit for Rm is universal 
for each m > 2, and in particular for 171 — 2, we have for x, y G K 

lim ^i?2 Lb + ————,£; ' ^ 



^-^^ {KNiE,E)Y^^^\^ ' Kn{E,E)'^ ' Kn{E,E) 

f ifoo(O) K^{x-y)\ .-..x 
^''Uoo(^-y) ifoo(O) ) ^^^> 

sin7r(a; — y) 
7r(x - y) 

For a Borel set i? C M, let = #{^j ■ ^ B} and let 

Vb = Exp{nB - Exp(nB))' (16) 
denote the number variance in B. A simple computation again shows that 

Vb — Ri{x)dx+ / / R2{x,y)dxdy— I / Ri{x)dx 

Jb J JBxB \JB 

For an energy E in the bulk of the spectrum as above, set 



For such _B, Vb is the number variance for an interval about E of scaled size s. 
Recalling that Kjq {E, E) ~ Ri (E) , and using ((T3|l , we find as oo 

1 r'^''' 1 -cosu . 2s f°° /sinw^ ^ 



lim Vb,(^s) = — / du+- / ] du. (17) 

For large s, the right-hand side has the form (see jMeh| ') 



^(log(2^s)+7+l)+o(i) (18) 



where 7 is Euler's constant. 

For > 0, the so-called gap probability 

GnM^) = Prob(M : M has no eigenvalues in (£; - 6*, + 0)) (19) 

is given by (see |Mehj . and also IDeill) 

Gjv,2(^) =clet(l~XwfL2 {E-e,E+0) ) (20) 

where AT^r \ L^{E-e,E+e) denotes the operator with kernel {T)) acting on L^{E — 9,E + 
6). In the bulk scaling limit, we find 

lim Gn,2{ .^ ,"1 j^. ] =dct{l-Koo\mE-e.E+e)), xeR. (21) 

In terms of the scaled eigenvalues Xj — Kn{E,E) ■ (Xj — E), this means that for 
a; > 

lim Prob(M : ^ {~x,x), 1 < j < N) ^ det(l - K^Il^^^,,,-)). (22) 

Now consider a point E, say E — 0, where Kn{E,E) = Kn{0,0) — > 00 as 
N 00. This is true, in particular, if 

Vn,2{x) = KmX^"" + ■ ■ ■ , Km>0,m>l, (23) 
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and so for Vn,2{x) — x"^ (GUE). For such Vjv,2's, we have Kn{Q,Q) ~ N^~^ (see 
inEMYZD)- Let tjv > be such that 

Then 

jV^Exp # |A,| < .7, nW = / * ifjv(A,A)dA^2tAr^«.. (25) 
For a <b, define the Borel set Ajv C by 



Then we have by ((T^ and lfT3|) . as iV ^ cx), 

-^Exp(#{ordered pairs («, j), i ^ j ■ (A^, Aj) £ Ajv}) 



4 // i?2(Ai,A2)dAidA2 



R2 _ , _ dsdt 



A^77{(s,t):a<s-t<b,|s|,|t|<t„} (ifjv(0,0))^ V^A^(0>0)' ii:Ar(0,0) 

iff 1 /!ifi^(£_Of|<,,,, 

"'{(s,t):a<s-t<b, |s|,|t|<tjv} V ^ 7r(S - tj 

TV 



1_ (m^)' ) 



/ sm nr \ 



1- (—^) ) dr, by 
Thus, if \j = ^^(0, 0)Aj are, again, the scaled eigenvalues, then for as in H24|l 
lim — Exp(#{ordered pairs i ^ j '■ a < A.^ — A,- < b, \Xi\, |A,| < t^}) 



(27) 

Another quantity of interest is the spacing distribution of the eigenvalues Ai < 
A2 < • • ■ < Aat of a random N x N matrix as N ^ 00. More precisely, for s > 0, 
we want to compute 



Exp 



#{1<J<A^-1 : Xj+i-Xj<s} 



N 



as N becomes large. If we again restrict our attention to eigenvalues in a neighbor- 
hood of a bulk energy E = 0, say, then the eigenvalue spacing distribution exhibits 
universal behavior for UE's as TV ^ 00. We have in particular the following result 
of Gaudin (see [Meh| . and also |Deil| ): With In, N and Xj — Kn{0, 0)xj as above, 
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lim Exp 



#{1 < J < - 1 : A,+i - Aj < s, |A,| < t^} 



N^oo \ N I 

= lim Prob(at least one eigenvalue Xj in (0, s] \ eigenvalue at 0) (28) 

N^oc 
s 

p{u) du 

Jo 
where 

p{u) = ^ (det(l - i^oorL^(o,«))) • (29) 

At the upper spectral edge E = Amax, one again finds universal behavior for UE's 
with potentials Vn.2, in particular, of the form H23|l above. For such Vn.2's> there 
exist constants s^' such that for t e K (see (DeiGio2| and the notes therein) 

^Ihn^Prob M : """(^^ ^ < t U det(l -A\mt,oo))- (30) 



N 



Here A is the so-called Airy operator with kernel 

X Ai{x)Ai'{y) - Ai'{x)Ai{y) 
A{x,y) = , 31 

x-y 

where Ai{x) is the classical Airy function. For GUE, where Vn,2{x) = x'^, one has 
z^"* — \/2N and s|^^ = 2~^7V~3 (see Forrester |Forl| and the seminal work of 
Tracy and Widom |TraWidl| ). 

It turns out that det(l — Kao\L^{-x.x)) in Pll> and det(l — A\L2i^f.,^-^^ can be 
expressed in terms of solutions of the Painleve V and Painleve II equations respec- 
tively. The first is a celebrated result of Jimbo, Miwa, Mori, and Sato 



and the second is an equally celebrated result of Tracy and Widom [TraWidl] . In 
particular for edge scaling we find 

lim Prob (m : ^^--'/n < A ^ ^ ^33) 

where 

Fp^2{t)^dei{l-A\mt,oo))^e--^r(^-^>'(^)ds (33) 

and u{s) is the (unique, global) Hastings-McLeod solution of the Painleve II equa- 
tion 

u"{s)^2v?'{s) + su{s) (34) 

such that 

u{s) ^ Ai[s) as s ^ +00. (35) 

F2{t) = Fi3=2{t) is called the Tracy- Widom distribution for /? = 2. 

Finally we note that for OE's and SE's there are analogs for all the above results 
H10|l - H35|) . and again one finds universality in the scaling limits as TV ^ 00 for 
potentials Vw,/?, /3 = 1,4, of the form above (see [DeiGiolj and |DeiGio2j and 
the historical notes therein). We note, in particular, the following results: for Vn,i3 
as above, f3 — 1 01 A, there exist constants z'}^\ sl^-* such that 

lim Prob I M : '^"""^^fi" ^^^^ < t] = Fp{t) (36) 
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where 

= (F2(t))^e~^'''*°° (37) 

and 

F,{t) = {F^{t)Y ±^ (38) 

with F2{t) and u{s) as above. Fi{t) and Fn{t) are called the Tracy-Widom distri- 
butions for f3 ~ I and 4 respectively. 

3. The problems 

In this section we consider seven problems. The first is from physics and is in- 
cluded for historical reasons that will become clear in S ect ion 0] below; the remaining 
six problems are from mathematics/mathematical physics. 

Problem 1 

Consider the scattering of neutrons off a heavy nucleus, say uranium U^"^^. The 
scattering cross-section is plotted as a function of the energy E of the incoming 



neutrons, and one obtains a jagged graph (see IPorl and jMehj ) with many hundreds 
of sharp peaks Ei < E2 < ■ ■ ■ and valleys E[ < E2 < ■ ■ ■ . If E ^ Ej for some 
j, the neutron is strongly repelled from the nucleus, and if _E '--^ E'^ for some j, 
then the neutron sails through the nucleus, essentially unimpeded. The Ej^s are 
called scattering resonances. The challenge faced by physicists in the late 40's and 
early 50's was to develop an effective model to describe these resonances. One 
could of course write down a Schrodinger-type equation for the scattering system, 
but because of the high dimensionality of the problem there is clearly no hope 
of solving the equation for the Ej^s either analytically or numerically. However, 
as more experiments were done on heavy nuclei, each with hundreds of Ej's, a 
consensus began to emerge that the "correct" theory of resonances was statistical, 
and here Wigner led the way. Any effective theory would have to incorporate two 
essential features present in the data, viz. 

(i) modulo certain natural symmetry considerations, all nuclei in the same 
symmetry class exhibited universal behavior 

(ii) in all cases, the Ej^s exhibited repulsion, or, more precisely, the probability 
that two Ej's would be close together was small. 

Question 1. What theory did Wigner propose for the -Ej 's? 
Problem 2 

Here we consider the work of H. Montgomery |Monj in the early 1970's on the 
zeros of the Riemann zeta function C(s)- Assuming the Riemann hypothesis, Mont- 
gomery rescaled the imaginary parts 71 < 72 < • • • of the (nontrivial) zeros {^+ijj} 
ofC(s), 

7.^7. = ^ (39) 
to have mean spacing 1 as T — s- 00, i.e. 

hm . 1. 

For any a < b, he then computed the two-point correlation function for the jj 's 
#{ordered pairs (ji,j2),ii 7^ j2 ■ 1 < ji, j2 < A^,7ji - % ^ (a.^)} 
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and showed, modulo certain technical restrictions, that 

R{a,b)= lim ^#{ordered pairs (ji, ja), ji 7^ ^2 : 1 < Ji, ^2 < ^, 7ji-7j2 ^ (a, 6)} 

(40) 

exists and is given by a certain explicit formula. 

Question 2. What formula did Montgomery obtain for R{a,b)7 

Problem 3 

Consider the solitaire card game known as patience sorting (see [AldPiaj and 
|Mal| ) . The game is played with N cards, numbered l,2,...,iV for convenience. The 
deck is shuffled and the first card is placed face up on the table in front of the 
dealer. If the next card is smaller than the card on the table, it is placed face up 
on top of the card; if it is bigger, the card is placed face up to the right of the first 
card, making a new pile. If the third card in the pile is smaller than one of the 
cards on the table, it is placed on top of that card; if it is smaller than both cards, 
it is placed as far to the left as possible. If it is bigger than than both cards, it 
is placed face up to the right of the pile(s), making a new pile. One continues in 
this fashion until all the cards are dealt out. Let qn denote the number of piles 
obtained. Clearly depends on the particular shuffle tt S Sn, the symmetric 
group on N numbers, and we write qn = '?a'(7I'). 

For example, if = 6 and tt = 34156 2, where 3 is the top card, 4 is the next 
card and so on, then patience sorting proceeds as follows: 

11 1 12 

3 34 34 345 3456 3456 

and qeiTr) — 4. 

Question 3. Equip Sn with the uniform distribution. If each card is of unit size, 
how big a table does one typically need to play patience sorting with N cards? Or, 
more precisely, how does 

Pn,N = Prob(7r : (7jv(7r) < n) (41) 
behave as oo, n < Nl 

Problem 4 

The city of Cuernavaca in Mexico (population about 500,000) has an extensive 
bus system, but there is no municipal transit authority to control the city transport. 
In particular there is no timetable, which gives rise to Poisson-like phenomena, with 
bunching and long waits between buses. Typically, the buses are owned by drivers 
as individual entrepreneurs, and all too often a bus arrives at a stop just as another 
bus is loading up. The driver then has to move on to the next stop to find his fares. 
In order to remedy the situation the drivers in Cuernavaca came up with a novel 
solution: they introduced "recorders" at specific locations along the bus routes in 
the city. The recorders kept track of when buses passed their locations, and then 
sold this information to the next driver, who could then speed up or slow down in 
order to optimize the distance to the preceding bus. The upshot of this ingenious 
scheme is that the drivers do not lose out on fares and the citizens of Cuernavaca 
now have a reliable and regular bus service. In the late 1990's two Czech physicists 
with interest in transportation problems, M. Krbalek and P. Seba, heard about 
the buses in Cuernavaca and went down to Mexico to investigate. For about a 
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month they studied the statistics of bus arrivals on Line 4 close to the city center. 
In particular, they studied the bus spacing distribution, and also the bus number 
variance measuring the fluctuations of the total number of buses arriving at a fixed 
location during a time interval T. Their findings are reported in KrbSeb,. 

Question 4. What did Krbalek and Seba learn about the statistics of the bus 
system in Cuernavaca? 

Problem 5 

In his investigation of wetting and melting phenomena in Pis , Fisher introduced 
various "vicious" walker models. Here we will consider the so-called random turns 
vicious walker model. In this model, the walks take place on the integer lattice 
Z and initially the walkers are located at 0, 1, 2, ... . The rules for a walk are as 
follows: 

(a) at each tick of the clock, precisely one walker makes a step to the left 

(b) no two walkers can occupy the same site (hence "vicious walkers" ) 

For example, consider the following walk from time i = to time t = A: At i = 0, 

X -xx-xx t=4 

X -x-xxx t = 3 

xx-xxx t = 2 

x-xxxx t = 1 

xxxxx t = 

-4-3-2-101234 

Figure 1 . Random turns walk. 

clearly only the walker at can move. At time t — 1, either the walker at -1 or at 
+1 can move, and so on. Let ^at be the distance traveled by the walker starting 
from 0. In the above example, d^ ~ 2. For any time N, there are clearly only a 
finite number of possible walks of duration t = N. Suppose that all such walks are 
equally likely. 

Question 5. How does dN behave statistically as iV — > oo? 
Problem 6 

Consider tilings {T} of the tilted square T„ — {{x,y) : \x\ + \y\ < n + 1} in 
by horizontal and vertical dominos of length 2 and width 1. For example, for 
rt = 3 we have the tiling T of Figure|21 For each tiling the dominos must lie strictly 
within T„. The tilings T are called Aztec diamonds because the boundary of T 
in {{x,y) : y > 0}, say, has the shape of a Mexican pyramid. It is a nontrivial 
theorem (see |EKLPp that for any n, the number of domino tilings of T„ is 2*2^' . 
Assume that all tilings are equally likely. 
Question 6. What does a typical tiling look like as n ^ oo? 

Finally we have 

Problem 7 

How long does it take to board an airplane? We consider the random boarding 
strategy in BBSSS under the following simplifying assumptions: 
(a) there is only 1 seat per row 
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Figure 2. Aztec diamond for n — 3. 

(b) the passengers are very thin compared to the distance between seats 

(c) the passengers move very quickly between seats. The main delay in board- 
ing is the time - one unit - that it takes for the passengers to organize their 
luggage and seat themselves once they arrive at their assigned seats. 

For the full problem with more than one seat per row, passengers who are not 
"very thin", etc., see |BBSSS| . and also the discussion of the boarding problem in 
Section 21 below. 

The passengers enter the airplane through a door in front and the seats are 
numbered 1,2,..., N, with seat 1 closest to the door. How does boarding proceed? 
Consider, for example, the case iV = 6. There are 6 passengers, each with a seating 
card 1,2,...,6. At the call to board, the passengers line up randomly at the gate. 
Suppose for definitiveness that the order in the line is given by 

tt: 341562 (42) 

with 3 nearest the gate. Now 3 can proceed to his'r seat, but 4 is blocked and must 
wait behind 3 while s'he puts his'r bag up into the overhead bin. However, at the 
same time, 1 can proceed to his'r seat, but 5,6, and 2 are blocked. At the end of 
one unit of time, 3 and 1 sit down, and 4 and 2 can proceed to their seats, but 5 
and 6 are blocked behind 4. After one more unit of time 4 and 2 sit down, and 5 
can proceed to his'r seat, but 6 is blocked. At the end of one more unit of time 5 
sits down, and finally 6 can move to his'r seat. Thus for tt as above, it takes 4 units 
of time to board. Let = foAr(7r) denote the boarding time for any tt G Sn, and 
assume that the tt's are uniformly distributed. 
Question 7. How does &Ar(7r) behave statistically as ^ oo? 

4. Solutions and explanations 

As indicated in the Introduction, the remarkable fact of the matter is that all 
seven problems in Section |2| are modeled by RMT. 

Problem 1. (neutron scattering) 

At some point in the mid-1950's, in a striking development, Wigner suggested 
that the statistics of the neutron scattering resonances was governed by GOE^ 
(and hence, by universality jPeiGiol] . by all OE's). And indeed, if one scales real 



Here we must restrict the data to scattering for situations where the nuclear forces are time- 
reversal invariant. If not, the statistics of the scattering resonances should be governed by GUE. 
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scattering data for a variety of nuclei according to the standard procedure and then 
evaluates, in particular, the nearest neighbor distribution, one finds remarkable 
agreement with the OE analog of the spacing distribution (|28f) . H29|) . 

It is interesting, and informative, to trace the development of ideas that led 
Wigner to his suggestion (see | Wigl| , | Wig2| , |Wig3| ; all three papers are repro- 
duced in [Porj ) . In these papers, Wigner is guided by the fact that any model for 
the resonances would have to satisfy the constraints of universality and repulsion, 
(i) and (ii) respectively, in the description of Problem 1. In |Wig2| he recalls a 
paper that he had written with von Neumann in 1929 in which they showed, in 
particular, that in the "("^+^) -dimensional space of real n x n symmetric matrices, 
the matrices with double eigenvalues form a set of codimension 2. For example, if a 
2x2 real symmetric matrix has double eigenvalues, then it must be a multiple of the 
identity and hence it lies in a set of dimension 1 in K.-^ . It follows that if one equips 
the space of real, symmetric matrices with a probability measure with a smooth 
density, the probability of a matrix M having equal eigenvalues would be zero and 
the eigenvalues Ai, . . . , A„ of M would comprise a random set with repulsion built 
in. So Wigner had a model, or more precisely, a class of models, which satisfied 
constraint (ii). But why choose GOE? This is where the universality constraint (i) 
comes into play. We quote from |Wig3p : "Let me say only one more word. It is 
very likely that the curve in Figure 1 is a universal function. In other words, it 
doesn't depend on the details of the model with which you are working. There is 
one particular model in which the probability of the energy levels can be written 
down exactly. I mentioned this distribution already in Gatlinburg. It is called the 
Wishart distribution. Consider a set... ." So in this way Wigner introduced GOE 
into theoretical physics: It provided a model with repulsion (and time-reversal) 
built in. Furthermore, the energy level distribution could be computed explicitly. 
By universality, it should do the trick! 

As remarkable as these developments were, even the most prophetic observer 
could not have predicted that, a few years down the line, these developments would 
make themselves felt within pure mathematics. 

Problem 2. (Riemann zeta function) 

Soon after completing his work on the scaling limit (|40|l of the two-point cor- 
relation function for the zeros of zeta, Montgomery was visiting the Institute for 
Advanced Study in Princeton and it was suggested that he show his result to Dyson. 
What happened is a celebrated, and oft repeated, story in the lore of the Institute: 
before Montgomery could describe his hard won result to Dyson, Dyson took out 
a pen, wrote down a formula, and asked Montgomery "And did you get this?" 



Montgomery was stunned: this was exactly the formula he had obtained. Dyson 
explained: "If the zeros of the zeta function behaved like the eigenvalues of a random 
GUE matrix, then H43|l would be exactly the formula for the two-point correlation 
function!" (See ^ above.) 



In the quotation that follows, "Figure 1" portrays a level spacing distribution, the "Wishart 
distribution" is the statisticians' name for GOE, and "Gatlinburg" is ,Wigl|. 




(43) 
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More precisely, what Montgomery actually proved was that 

l<i^j<N JK \ / 

for any rapidly decaying function / whose Fourier transform /(^) is supported in 
the interval |^| < 2. Of course, if one could prove (|44|l for all smooth, rapidly 
decaying functions, one would recover the full result 1)43(1 . Nevertheless, in an 
impressive series of numerical computations starting in the 1980's, Odlyzko verified 
(I43|l to extraordinary accuracy (see |Odll| . |0dl2| . and the references therein). In 
his computations, Odlyzko also considered GUE behavior for other statistics for the 
7j's, such as the nearest neighbor spacing, verifying in particular the relationship 

lim < J < - 1 : 7j+i - Ij < s} ^ f p{u) du, s > 0, (45) 

(cf (|2H1), 123) to high accuracy. 

The relationship between the zeros of the zeta function and random matrix 
theory first discovered by Montgomery has been taken up with great virtuosity by 
many researchers in analytic number theory, with Rudnick and Sarnak |RudSar| . 
and then Katz and Sarnak KatSar", leading the way. GUE behavior for the zeros 
of quite general automorphic L-functions over Q, as well as for a wide class of zeta 
and L-functions over finite fields, has now been established (modulo technicalities 
as in H44|l above in the number field case). Another major development has been the 
discovery of a relationship between random polynomials whose roots are given by 
the eigenvalues of a matrix from some random ensemble, and the moments of the L- 
functions on the critical line Rez = ^ (see [KeaSnal] . IKeaSna^) !. The discovery of 
Montgomery/Odlyzko counts as one of the major developments in analytic number 
theory in many, many years. 

Problem 3. (Patience sorting) 

In 1999 Baik, Deift and Johansson |BDJ1| proved the following result for (7Ar(7r), 
the number of piles obtained in patience sorting starting from a shuffle tt of A'^ 
cards. Let xn = Then 

lim Prob(xAr <t) = F2{t) (46) 

N—^oo 

where F2 is the Tracy- Widom distribution (|32|l . (|33|l for (3 — 2. Thus the number of 
piles, suitably centered and scaled, behaves statistically like the largest eigenvalue 
of a GUE matrix. In addition, the authors proved convergence of moments. For 
any m — 1,2,..., 

lim Exp(x:^) = IExp(x") (47) 

N — *OD 

where x is any random variable with distribution F2. In particular, for to = 1,2 
one obtains 

Exp(gAr) - 2ViV 

N^oc 

and 

Var(g Af) 



Jim = / tdF^it) (48) 



jim_ — ^ - / dF2{t) -( I tdF2{t)] . (49) 
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Numerical evaluation shows that the constants on the right-hand side of (|48|1 and 
(gnj are given by -1.7711 and 0.8132, respectively. Thus, as iV —> oo, 

Exp(qAr) - 2^fN - 1.7711 • N^/^ 

so that for a deck of = 52 cards, one needs a table of size about 12 units on 
average to play the game. 

Patience sorting is closely related to the problem of longest increasing subse- 
quences for permutations tt e Sn- Recall that we say that 7r(ii), . . . ,7r(ifc) is an 
increasing subsequence in tt of length k li ii < i2 < ■ ■ ■ < ik and 7r(ii) < 7r(i2) < 
• • • < irlik)- Let In{t^) be the length of the longest increasing subsequence in tt. 
For example, if = 6 and tt = 341562 we see that 3 4 5 6 is a longest increas- 
ing subsequence for tt and hence le{Tr) = 4. Comparing with the introduction to 
Question 3, we see that leiT^) = (76 (tt). This is no accident: for any tt €i S]\[ , we 
always have In{t^) = QnM (see, e.g. [AldPia p. and hence we learn from that 
the length In of the longest increasing subsequence behaves statistically like the 
largest eigenvalue of a GUE matrix as A'^ ^ oo. The relation /Ar(7r) = QNiT^) and 
()48|l imply in particular that 

Hni 5^Pg^ = 2. (50) 

The claim that the limit in H50fl exists, and equals 2, is known as "Ulam's problem" 
and has a long history (see |BDJlj ). In another direction, uniform distribution on 
Sn pushes forward under the Robinson-Schensted correspondence (see, e.g. |Sag| ) 
to so-called Plancherel measure on Young diagrams of size A^. Young diagrams are 
parameterized by partitions /i h A^, {/i = (/ii, /j,2, ■ ■ ■ , ^-i) ■ A'l > > • • • > A*i > 
1, X^i^i Mi ~ where /i^ is the number of boxes in the i'^ row, and it turns out 
that under the correspondence we have 

Prob(7r : /Ar(7r) < n) = Prob(^ h A^ : < n). (51) 

Consequently, the number of boxes in the first row of Plancherel-random Young 
diagrams behaves statistically, as A^ ^ cx), like the largest eigenvalue of a GUE 
matrix. In (BDJT, the authors conjectured that the number of boxes in the first 
k rows of a Young diagram should behave statistically as A^ ^ cx) like the top 
k eigenvalues Xn > Aat-i > • • • > Xn-m+i of a GUE matrix. This conjecture 
was proved for the 2"*^ row in |BDJ2j . For general k, the conjecture was proved, 
with convergence in joint distribution, in three separate papers in rapid succession 
f |Oko| . |BOO| . |Johlj ). using very different methods. The proof in |Okoj relies on 
an interplay between maps on surfaces and ramified coverings of the sphere; the 
proof in |BOO| is based on the analysis of specific characters on S{oo), the infinite 
symmetric group defined as the inductive limit of the finite symmetric groups Sn 
under the embeddings Sn ^ Sn+1i and the proof in [Johl, utilizes certain discrete 
orthogonal polynomial ensembles arising in combinatorial probability. 

One can consider the statistics of In{t^) for tt restricted to certain distinguished 
subsets of Sn (see |BaiRail] l. Amongst the many results in ^BaiRail relating 
combinatorics and random matrix theory, we mention the following. Let S*^"^"* = 
{tt £ Sn ■ TT^ = id} be the set of involutions in Sn- Then, under the Robinson- 
Schensted correspondence, uniform distribution on iS'jj"^'' pushes forward to a new 
measure on Young diagrams, different from the Plancherel measure. Denote this 
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measure by Prob*-'"^-' , and in place of (|51|l we have 

Prob(7r e 5^""^ : In{tt) < n) = FIoh'■"'^^^l h iV : < n). 
In [ BaiRail) the authors show that 

hm ProbfTT e S^r^ : ^""'ff^ < x 
= hm Proh^--U^^^N■.^'~ff'<x) ^''^ 

and for the second row of fi 

hm Prob("-) UhN : ~jf^ < x) = Fp^,{x). (53) 

Here F^=i and F/3=4 are the Tracy- Widom distributions for the largest eigenvalue 
of the GOE and GSE ensemble, respectively (see ((SEll , (|SZI) , ISHl) ) ■ Thus ah three 
of the basic ensembles P — 1,2 and 4 show up in the analysis of the (general) 
increasing subsequence problem. 

A problem which is closely related to the longest increasing subsequence problem 
is the random word problem. In |TraWid4| the authors consider words {oj} of length 
TV in an alphabet of k letters, i.e. maps uj : {1,2, ...,A^} {1,2, ... ,k}. One 
says that uj{ii), . . . ,uj{ij) is a weakly increasing subsequence in to of length j if 

ii < 12 < ■ ■ ■ < ij and < ^(12) < • • • < ^(*j)- Let (w) denote the length 

of the longest weakly increasing subsequence in u. Assuming that all words are 
equally likely, Tracy and Widom in (TraWid4j proved that 



I (lu) — — f 
lim Probfw : ^ ^ < = 7fc / e-^ti^? J| {x^-Xjfdxi---dx 



N^oo \ 2N / Jr ,^ 



k-l 



(54) 



where 
and 



Cs = Hxi, . . . , Xk) ■ max Xi < s, xi + ■ ■ ■ + Xk = 0} (55) 

l<i<k 

Vk2 ^ 

Ik ^ jrr ■ (56) 

It is easy to see that the right-hand side of is just the distribution function for 
the largest eigenvalue of a fc x QUE matrix conditioned to have trace zero. 
Consider the representation of the number tt, say, in any basis b, 

TT — 0.aia2a3 . . . X b"^ , q e Z. (57) 

It has long been believed that in some natural asymptotic sense the digits ai, 02, as, . . 
are independent and identically distributed, with uniform distribution on {0, 1, . . . ,b- 
1}. In an attempt to formalize this notion, E. Borel (1909) introduced the idea of 
normality (see | Wag| ) : A real number x is normal if for any base b, any m > 1, 
and any m-string s, 

#|occurrences of s in the first n base-6 digits of x} 
hm — = b . (58) 
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While it is known that non-normal numbers form a set of Lebesgue measure zero, 
and all numerical evidence confirms H58() to high order, no explicit examples of 
normal numbers are known. 

Relation H54() suggests a new way to test for asymptotic randomness, as follows. 
Consider the first LN base-6 digits aia2 ■ ■ ■ a^jv of a given number x, where L 
and N are "large". Partition these digits into L words loj = a(j_i)7v ••• ajAr, 
I < j < L, each of length N. For each LOj compute l^{ujj). Then if the digits {aj} 
of X are asymptotically random, we could expect that as L, iV — > oo, the empirical 
distribution 

is close to the conditional GUE distribution on the right-hand side of 154|l . Pre- 
liminary calculations in jPeiWitl for x = tt and b = 2 show that for L, N "large" 
the empirical distribution is indeed close to the right-hand side of (|54|l with high 
accuracy. The work is in progress. 

Problem 4. (Bus problem in Cuernavaca) 

Krbalek and Seba found that both the bus spacing distribution and the number 
variance are well modeled by GUE, |j2Hl), and (fTTjl respectively (see Figures 2 
and 3 in jKrbSeb) ). In order to provide a plausible explanation of the observations 
in |KrbSebj , the authors in BBDS) introduced a microscopic model for the bus line 
that leads simply and directly to GUE. 

The main features of the bus system in Cuernavaca are 

(a) the stop-start nature of the motion of the buses, 

(b) the "repulsion" of the buses due to the presence of recorders. 

To capture these features, the authors in |BBDSj introduced a model for the buses 
consisting of n(= # of buses) independent, rate 1 Poisson processes moving from 
the bus depot at time t = to the final terminus at time T, and conditioned not 
to intersect for < t <T. The authors then showed that at any observation point 
X along the route of length N > n, the probability distribution for the (rescaled) 
arrival times of the buses, j/j = ^ — 1 G [—1, 1], 1 < j < is given by 

n 

const. Yl Wj ivj) Y[ ~ yj)^ dyi--- dy^ (59) 

where 

w;,,(2/)-(l + 2;)--i(l-2/)^— -K y < 1. (60) 
Formula H59|l is precisely the eigenvalue distribution for the so-called Jacobi Unitary 
Ensemble (cf. Q with e^^"'^^^^ = wj{y) — weight for Jacobi polynomials on 
[—1,1]). In the appropriate scaling limit, GUE then emerges by universality. The 
authors also compute the distributions of the positions Si, . . . , a;„ of the buses at any 
time t G (0, T). Again the statistics of the described by a Unitary Ensemble, 

but now w J is (|59ll is replaced by the weight for the Krawtchouk polynomials: by 
universality, GUE again emerges in the appropriate scaling limit. 

In an intriguing recent paper, Abul-Magd .Abu, noted that drivers have a ten- 
dency "to park their cars near to each other and at the same time keep a distance 
sufficient for manoeuvring." He then analyzed data measuring the gaps between 
parked cars on four streets in central London and showed quite remarkably that the 
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gap size distribution was well represented by the spacing distribution H28(l . (|29|l of 
GUE. It is an interesting challenge to develop a microscopic model for the parking 
problem in |Abu| . analogous to the model for the bus problem in [BBDSj. 

Problem 5. (Random turns vicious walker model) 

In jBaiRai2) the authors proved that, as ^ oo, djv, the distance traveled by 
the walker starting from 0, behaves statistically like the largest eigenvalue of a GOE 
matrix. More precisely, they showed that 



where -Fi is given by H37|l . In a variant of this model, |For3| . the walkers again start 
at 0,1,2,..., and move to the left for a time N; thereafter they must move to the 
right, returning to their initial positions 0,1,2,... at time 2N. Let d'jy denote the 
maximum excursion of the walker starting from 0. Then Forrester shows that d'j^ 
behaves statistically like the largest eigenvalue of a GUE matrix. 



where F2 is given by H33|) . 

The proofs of (|61|l and (|B2|I rely on the observation of Forrester in |For3| that, 
in the first case, the set of walks is in one-to-one correspondence with the set 
y^-'^) of standard Young tableaux of size N (see |Sag| ), whereas in the second case, 
the variant model, the set of walks is in one-to-one correspondence with the set 
y(^) of pairs {P, Q) of standard Young tableaux of size N with the same shape, 
sh(P) = sh((5). In both cases, djv and d'^ equal the number of boxes in the first 
row of the corresponding standard Young tableaux. Uniform measure on Y'^'^^ (resp 
yd)) 

gives rise to Plancherel measure (resp. Prob*^'"^-*) on Young diagrams of size 
N , and the proof of (|62|) then follows from 14()l) . (I51II . and the proof of (|61|) follows 
from (|^ . 

In p3ai , Baik proved the analogue of 1)61(1 . H62|l for the so-called lock step vicious 
walker introduced in |Fis| . The proof in |Bai| relies in part on an observation of 
Guttmann et al. in |GOV| . which preceded |For3| . that the set of path config- 
urations for the lock step model is in one-to-one correspondence with the set of 
semi-standard Young tableaux (see |Sag| ). 

Problem 6. (Aztec diamond) 

After scaling by n + 1, Jockush et al., |JPS| . considered the tiling problem with 
dominos of size x in the tilted square Tq = {{u^v) : \u\ + \v\ < 1}. 
As n 00, they found that the inscribed circle Cq = {{u,v) : v? + — i}, 
which they called the arctic circle^ plays a remarkable role. In the four regions 
of To outside Co, which they call the polar regions and label N, E, S, W clock- 
wise from the top, the typical tiling is frozen, with all the dominoes in N and 
S horizontal, and all the dominos in E and W vertical. In the region inside 
Co, which they call the temperate zone, the tiling is random. (See, for exam- 
ple, http://www.math.wisc.edu/~propp/tiling, where a tiling with n = 50 is 
displayed.) 

But more is true. In j.Tohlj . |,Toh2j . Johansson considered fluctuations of the 
boundary of the temperate zone about the circle Co. More precisely, for — 1 < a < 




(61) 




(62) 
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l,a^O, let 

V-^Q 1 Wa / V 2 ' 2 /' y-^a^Ua) ~ \ 2 ' 2 / 

denote the two points of intersection of the hne u + v = a with Cq = {u^ + v'^ — i}. 
Then for fixed a, Johansson showed that the fluctuations of the boundary of the 
temperate zone along the line u + v — a about the points (x'^^Ua) and {x~,y~) 
were described by the Tracy- Widom distribution F2 (see |,Toh2| . equation (2.72), for 
a precise statement). Johansson proceeds by expressing the fluctuations in terms 
of the Krawtchouk ensemble (cf. Problem 4), which he then evaluates asymptoti- 
cally as n ^ 00. Such an analysis is possible because the associated Krawtchouk 
polynomials have an integral representation which can be evaluated asymptotically 
using the classical method of steepest descent. In |CLP) . the authors considered 
tilings of hexagons of size n by unit rhombi and proved an arctic circle theorem 
for the tilings as n ^ 00 as in the case of the Aztec diamond. In jJohl| . |Joh2| . 
Johansson again expressed the fluctuations of the arctic circle for the hexagons in 
terms of a random particle ensemble, but now using the Hahn polynomials rather 
than the Krawtchouk polynomials. The Hahn polynomials, however, do not have 
a convenient integral representation and their asymptotics cannot be evaluated by 
classical means. This obstacle was overcome by Baik et al., jBKMMj . who extended 
the Riemann-Hilbert/steepest descent method in DKMVZT| and |nKMVZ2 to a 
general class of discrete orthogonal polynomials. In this way they were able to 
compute the asymptotics of the Hahn polynomials and verify _F2-behavior for the 
fluctuations of the temperate zone, as in the case of the Aztec diamond. 



Problem 7. (Airline boarding) 

In |BBSSSj the authors show that feAr(7r), the boarding time for N passengers 
subject to the protocol (a)(b)(c) in Problem 7, behaves statistically like the largest 
eigenvalue of a GUE matrix. 

The proof of H63(l in j3BSSS relies on the description of the Robinson-Schensted 
correspondence in terms of Viennot diagrams (see |Sag| ). We illustrate the situation 
with the permutation tt : 341562 in 5*6 (cf. Problem 3 and (02)). We say that a 
point {x' , y') lies in the shadow of a point {x, y) in the plane if x' > x and y' > y. 
Plot TT as a graph (1, 3), (2, 4), . . . , (6, 2) in the first quadrant of R^. Consider all the 
points in the graph which are not in the shadow of any other point: in our case (1, 3) 
and (3, 1). The first shadow line Li is the boundary of the combined shadows of 
these two points (see Figure|21l. To form the second shadow line L2, one removes the 
points (1,3), (3,1) on Li, and repeats the procedure, etc. Eventually one obtains 
fcjv(7r) = k shadow hues Li, L2, ■ ■ ■ ,Lk for some integer k. In our example k — A, 
which we note is precisely leiT^), the length of the longest increasing subsequence 
for TT. This is no accident: for any tt € Sn, we always have fcAr(7r) = InIt^) (see 
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2 L, 

1 L, 

12 3 4 5 6 

Figure 3. Shadow lines for tt : 3 4 1 5 6 2 in S'e- 

The beautiful fact is that each shadow line describes a step in the boarding 
process. Indeed, noting the y- values of the Lj^s, we observe that 

Li < — > 3 and 1 are seated 

L2 < — > 4 and 2 are seated 

< — > 5 is seated 

L4 < — > 6 is seated 

Thus 6Ar(7r) = kiy{Tr) — In{t^) — qN{T^), and (jH^ follows from (@BJ. In the language 
of physics, if we rotate the Viennot diagram for tt counterclockwise by 45°, we see 
that the shadow region of a point on the graph is simply the forward light cone based 
at that point (speed of light=l). In other words, for appropriate coordinates a, b we 
are dealing with the Lorentzian metric ds^ = dadb. In order to incorporate more 
realistic features into their boarding model, such as the number of seats per row, 
average amount of aisle length occupied by a passenger, etc., the authors in BBSSS] 
observe that it is enough simply to replace ds^ — dadb by a more general Lorentzian 
metric ds^ — AD'^p{a,b){dadb + ka{a,b)da?) for appropriate parameters/functions 
Z?,p, k and a (see BBSSS , equation (1)). Thus the basic phenomenon of blocking 
in the airline boarding problem is modeled in the general case by the forward light 
cone of some Lorentzian metric. 

Problems 1 and 2 above, as opposed to 3-7, are purely deterministic and yet 
it seems that they are well described by a random model, RMT. At first blush, 
this might seem counterintuitive, but there is a long history of the description of 
deterministic systems by random models. After all, the throw of a (fair) 6-sided 
die through the air is completely described by Newton's laws: Nevertheless, there 
is no doubt that the right way to describe the outcome is probabilistic, with a one 
in six chance for each side. With this example in mind, we may say that Wigner 
was looking for the right "die" to describe neutron scattering. 

Problems 1-7 above are just a few of the many examples now known of math- 
ematical/physical systems which exhibit random matrix type universal behavior. 
Other systems, from many different areas, can be found for example in |Meh| and the 
reviews |TraWid3| . |For2| . and FerPra . A particularly fruitful development has 
been the discovery of connections between random matrix theory and stochastic 
growth models in the KPZ class ( (PraSpo| , jFerPrap . and between random matrix 
theory and equilibrium crystals with short range interactions ( [CerKen] . |FerSpo| , 
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[OkoResj ■ |FerPraSpo| ). In addition, for applications to principal component anal- 
ysis in statistics in situations where the number of variables is comparable to the 
sample size, see jJohn) and |EBPI and the references therein. For a relatively recent 
review of the extensive application of RMT to quantum transport, see |Bee| . 

Returning to Wigner's introduction of random matrix theory into theoretical 
physics, we note that GOE is of course a mathematical model far removed from 
the laboratory of neutrons colliding with nuclei. Nevertheless, Wigner posited that 
these two worlds were related: With hindsight, we may see Wigner's insight as a 
prophetic vision of a scientific commonality across the far borders of physics and 
mathematics. 

5. Section 5 

As is clear from the text, many different kinds of mathematics are needed to 
analyze Problems 3-7. These include 

• combinatorial identities 

• Riemann-Hilbert methods 

• Painleve theory 

• theory of Riemann surfaces 

• representation theory 

• classical and Riemann-Hilbert steepest descent methods 
and, most importantly, 

• random matrix theory. 

The relevant combinatorial identities are often obtained by analyzing random par- 
ticle systems conditioned not to intersect, as in Problem 4. The Riemann-Hilbert 
steepest descent method has its origins in the theory of integrable systems, as in 
|DeiZho| . There is no space in this article to describe the implementation of any 
of the above techniques in any detail. Instead, we refer the reader to |Dei2| . which 
is addressed to a general mathematical audience, for a description of the proof of 
(|46|l in particular, using Gessel's formula in combinatorics |Ges| . together with the 
Riemann-Hilbert steepest descent method. For Problem 2 the proofs are based on 
combinatorial facts and random matrix theory, together with techniques from the 
theory of L-functions, over Q and also (in IKatSarp over finite fields. 

Universality as described in this article poses a challenge to probability theory 
per se. The central limit theorem (jJl above has three components: a statistical 
component (take independent, identically distributed random variables, centered 
and scaled), an algebraic component (add the variables), and an analytic component 
(take the limit in distribution as n — )■ oo). The outcome of this procedure is then 
universal - the Gaussian distribution. The challenge to probabilists is to describe an 
analogous purely probabilistic procedure whose outcome is Fi, or F2, etc. The main 
difhculty is to identify the algebraic component, call it operation X. Given X, if one 
takes i.i.d.'s, suitably centered and scaled, performs operation X on them, and then 
takes the limit in distribution, the outcome should be Fi, or F2, etc. Interesting 
progress has been made recently (see [BodMarj and IBaiSuip on identifying X for 
F2. For a different approach to the results in jBodMar) and [BaiSuij . see |Suij . where 
the author uses a very interesting generalized version of the Lindeberg principle due 
to Chatterjee |Chalj . |Cha2j . 
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Our final comment/speculation is on the space D, say, of probability distribu- 
tions. A priori, D is just a set without any "topography". But we know at least one 
interesting point on D, the Gaussian distribution F^. By the central limit theorem, 
it lies in a "valley" , and nearby distributions are drawn towards it. What we seem 
to be learning is that there are other interesting distributions, like Fi or F2, etc., 
which also lie in "valleys" and draw nearby distributions in towards them. This 
suggests that we equip D with some natural topological and Riemannian structure, 
and study the properties of D as a manifold per se. 
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